Back

Biology Methods and Protocols

19 training papers 2019-06-25 – 2026-03-07

Top medRxiv preprints most likely to be published in this journal, ranked by match strength.

1
Predictive Value of Blood Tests in Postoperative Delirium for Abdominal Surgery Patients
#1 (1.8%)
Show abstract

BackgroundPostoperative delirium is a common complication in surgical patients, and is associated with a multitude of negative outcomes, including mortality, dementia, and increased healthcare costs. Therefore, a better understanding of what factors contribute to postoperative delirium, especially those that can be easily obtained, is important. MethodsWe conducted a retrospective cohort study using patients from the Medical Information Mart for Intensive Care (MIMIC)-IV database. Adult patient...

2
DBT-2026, a de-identified publicly available dataset of digital breast tomosynthesis exams with ground truth biopsies
2026-03-04 radiology and imaging 10.64898/2026.03.03.25337924
Top 0.2% (1.5%)
Show abstract

Digital breast tomosynthesis (DBT) is a powerful imaging modality that allows for improved lesion visibility, characterization, and localization compared to conventional two-dimensional digital mammography. DBT has been increasingly adopted in screening and diagnostic settings globally, particularly for women with dense breast tissue where tissue overlap presents a significant diagnostic challenge. Here we describe DBT-2026, a real world imaging dataset with 558 DBT exams from 558 patients with ...

3
CardioPulmoNet: Modeling Cardiopulmonary Dynamics for Histopathological Diagnosis
2026-02-20 health informatics 10.64898/2026.02.19.26346620
Top 0.2% (1.5%)
Show abstract

ObjectiveThis study investigates whether incorporating physiological coupling concepts into neural network design can support stable and interpretable feature learning for histopathological image classification under limited data conditions. MethodsA physiologically inspired architecture, termed CardioPulmoNet, is introduced to model interacting feature streams analogous to pulmonary ventilation and cardiac perfusion. Local and global tissue features are integrated through bidirectional multi-h...

4
Large-Scale Pharmacokinetic Reconstruction of Propofol Effect-Site Concentrations: Quantifying the Divergence between Clinical Titration and Age-Dependent Pharmacodynamic Requirements
2026-03-05 anesthesia 10.64898/2026.03.04.26347547
Top 0.2% (1.5%)
Show abstract

BackgroundPropofol dosing guidelines recommend age-based reductions because hypnotic sensitivity increases in older adults. Most real-world evaluations of induction practice, however, have relied on total weight-normalized dose (mg/kg) rather than estimating cerebral exposure using pharmacokinetic models. Because age-related pharmacokinetic changes alter the relationship between administered dose and peak effect-site concentration (Ce,max), mg/kg surrogates may misrepresent true age-dependent ex...

5
Large Language Models Readability Classification: A Variability Analysis of Sources and Metrics
2026-03-02 public and global health 10.64898/2026.02.20.26346638
Top 0.4% (1.3%)
Show abstract

AbstractAccurate health information is ineffective if patients cannot understand it. Large Language Model (LLM) health research values veridical precision; however, linguistic accessibility remains an under-examined component of output quality and usability. This study investigated two sources of variability in readability classification: differences across LLM systems and across readability metrics. The analysis tested 1,120 data points from seven systems in English and Portuguese, comparing ba...

6
Deep Learning-Based Missing Value Imputation for Heart Failure Data from MIMIC-III: A Comparative Study of DAE, SAITS, and MICE+LightGBM
2026-02-11 health systems and quality improvement 10.64898/2026.02.10.26345979
Top 0.4% (1.3%)
Show abstract

BackgroundElectronic Health Records(EHR) are very crucial for Clinical Decision Support Systems and for proper care to be delivered to ICU heart failure patients, there is often missing data due to monitoring device errors thus the need for robust imputation methodologies. ObjectiveTo compare and evaluate three different methodologies for imputing missing data for heart failure patients from the MIMIC-III database: Denoising Autoencoder (DAE), Self-Attention Imputation for Time Series (SAITS), ...

7
Can AI Match Human Experts? Evaluating LLM-Generated Feedback on Resident Scholarly Projects
2026-03-04 medical education 10.64898/2026.03.04.26346878
Top 0.5% (1.3%)
Show abstract

BackgroundDelivering timely, high-quality feedback on resident scholarly projects is labour-intensive, especially in large programmes. We developed an AI-assisted evaluation system, powered by the open-weight LLaMA-3.1 large-language model (LLM), to generate formative feedback on Family Medicine residents scholarly projects and compared its performance with expert human evaluators. MethodsWe evaluated whether the AI-generated feedback achieves comparable quality to expert feedback. The tool ing...

8
Discordant Care as a Computable Phenotype: Real-Time Detection of Routine Protocol Completion Without Cognitive Patient Engagement Predicts Hospital Mortality in the ICU"
2026-02-26 intensive care and critical care medicine 10.64898/2026.02.24.26347021
Top 0.6% (1.2%)
Show abstract

BackgroundQuality measurement in intensive care emphasizes task completion--whether assessments were documented and protocols followed. Electronic health record (EHR) systems capture these signals in real time, yet current metrics cannot distinguish task completion from cognitive clinical engagement. A prior analysis demonstrated that omission of orientation assessment predicted a 4.29-fold increase in hospital mortality among low-acuity ICU patients [1]. Whether combining this marker with routi...

9
Federated penalized piecewise exponential model for horizontally distributed survival data: FedPPEM
2026-02-12 health informatics 10.64898/2026.02.11.26346054
Top 0.6% (1.2%)
Show abstract

Cox proportional hazard regressions are frequently employed to develop prognostic models for time-to-event data, considering both patient-specific and disease-specific characteristics. In high-dimensional clinical modeling, these biological features can exhibit high collinearity due to inter-feature relationships, potentially causing instability and numerical issues during estimation without regularization. For rare diseases such as acute myeloid leukemia (AML), the sparsity and scarcity of data...

10
Graph-Augmented Retrieval for Digital Evidence-Based Medical Synthesis: A Proof-of-Concept Study on Topology-Aware Mechanistic Narrative Generation
2026-02-19 health systems and quality improvement 10.64898/2026.02.18.26346545
Top 0.6% (1.1%)
Show abstract

BackgroundRetrieval-augmented generation (RAG) frameworks such as RAPID [1] have demonstrated that staged planning and retrieval grounding improve long-form text generation. However, most implementations remain similarity-driven and open-domain, lacking the epistemic safeguards required for biomedical synthesis, where mechanistic completeness, temporal governance, traceability, and explicit gap classification are essential. ObjectiveTo develop and evaluate a topology-aware, graph-augmented retr...

11
Development and Validation of Regression-based Neuropsychological Testing Norms for Peruvian adults to detect HIV-associated Neurocognitive Impairment
2026-02-11 neurology 10.64898/2026.02.09.26345550
Top 0.8% (1.0%)
Show abstract

IntroductionNeurocognitive impairment (NCI) remains common among people living with HIV (PWH), particularly in low- and middle-income countries where accurate diagnostic tools are limited. In Peru, the lack of locally validated neuropsychological (NP) normative data in Spanish poses a major barrier to diagnosing HIV-associated NCI, especially among PWH who develop NCI at younger ages. This study aimed to develop regression-based NP norms for young and middle-aged Spanish-speaking adults in Lima,...

12
Lesion-Centric Latent Phenotypes from Segmentation Encoders for Breast Ultrasound Interpretability
2026-03-06 radiology and imaging 10.64898/2026.03.06.26347800
Top 0.8% (1.0%)
Show abstract

We propose a lesion-centric phenotype learning pipeline for interpretable breast ultrasound (BUS). Predicted lesion masks are used for mask-weighted pooling of segmentation-encoder latents, producing compact embeddings that suppress background influence; a lightweight calibration step improves cross-dataset consistency. We cluster embeddings to discover latent phenotypes and relate phenotype structure to morphology descriptors (compactness, boundary sharpness). On BUSI and BUS-UCLM with external...

13
PrivateBoost: Privacy-Preserving Federated Gradient Boosting for Cross-Device Medical Data
2026-02-12 health informatics 10.64898/2026.02.10.26345891
Top 0.8% (1.0%)
Show abstract

Cross-device medical federated learning--where individual patients participate directly rather than institutions--poses a unique challenge: each client holds only a few samples, often just one (e.g., a single diagnostic record), leaving insufficient local data for gradient computation. Existing approaches, such as Secure Aggregation, require client-to-client coordination impractical for intermittently available mobile devices, while homomorphic encryption introduces substantial computational ove...

14
Diagnostic Accuracy of Artificial Intelligence for Arrhythmia Detection Using the 12-Lead Electrocardiogram: A Systematic Review and Meta-Analysis
2026-02-11 cardiovascular medicine 10.64898/2026.02.06.26345251
Top 0.9% (1.0%)
Show abstract

BackgroundArtificial intelligence (AI) has emerged as a promising tool for interpreting 12-lead electrocardiograms (ECGs), with the potential to enhance diagnostic accuracy for arrhythmia detection. However, published studies vary widely in methodology and validation strategy, warranting a quantitative synthesis of diagnostic performance. MethodsA systematic review and meta-analysis was conducted according to the PRISMA-DTA 2018 guidelines and registered in PROSPERO (CRD420251027264). Searches ...

15
Comprehensive Evaluation of Associations between Lifestyle Factors and Multiple Epigenetic Aging Indicators in the Japanese Population: A cross-sectional study
2026-02-09 epidemiology 10.64898/2026.02.07.26345813
Top 1.0% (0.9%)
Show abstract

BackgroundEpigenetic clocks based on DNA methylation (DNAm) provide quantitative indicators of biological aging. However, the extent to which diverse lifestyle factors influence DNAm-based aging measures remains unclear, especially in Japanese populations. We aimed to evaluate the associations between 52 lifestyle-related factors and multiple epigenetic aging indicators, including six DNAm ages (Horvath, Hannum, PhenoAge, GrimAge, GrimAge v2, and PCPhenoAge specific to Japanese Population), the ...

16
Biomedical Large Language Models and Prompt Engineering for Causality Assessment of Individual Case Safety Reports in Pharmacovigilance
2026-02-24 pharmacology and therapeutics 10.64898/2026.02.19.26346467
Top 1% (0.8%)
Show abstract

BackgroundBiomedical Large Language Models (LLMs) combined with prompt engineering offer domain-specific reasoning, yet their application to individual-level causality assessment remains unexplored. This study evaluated five combinations of biomedical LLMs, prompting strategies, and causality algorithms by comparing their agreement with two human expert evaluators. Research design and methodsA total of 150 Individual Case Safety Reports (ICSRs) were analyzed: 140 reports from Food and Drug Admi...

17
Fully Automated Systematic Review Generation via Large Language Models: Quality Assessment and Implications for Scientific Publishing
2026-02-23 health informatics 10.64898/2026.02.18.26346559
Top 1% (0.7%)
Show abstract

Large language models (LLMs) are increasingly transforming scientific workflows, yet their application to rigorous evidence synthesis remains underexplored. Through the execution of a single Python script, we present a fully automated pipeline leveraging the Claude API to generate systematic reviews from literature search through manuscript completion without human intervention. Our pipeline processes hundreds of papers through iterative API calls for inclusion evaluation, information extraction...

18
GEN-KnowRD: Reframing AI for Rare Disease Recognition
2026-03-03 health informatics 10.64898/2026.03.02.26347469
Top 1% (0.7%)
Show abstract

Rare diseases affect over 300 million people worldwide, yet patients often endure years-long diagnostic delays that limit timely intervention and trial opportunities. Computational rare disease recognition (RDR) remains constrained by knowledge resources that are often incomplete, heterogeneous, and dependent on extensive multi-disciplinary expert curation that cannot scale. Large language models (LLMs) applied directly for end-to-end diagnosis or disease discrimination face similar knowledge bo...

19
Evaluating a Locally Deployed 20-Billion Parameter Large Language Model for Automated Abstract Screening in Systematic Reviews
2026-03-04 health informatics 10.64898/2026.03.04.26347506
Top 1% (0.7%)
Show abstract

BackgroundSystematic reviews (SRs) are essential for evidence-based medicine but require extensive time and resources for abstract screening. Large language models (LLMs) offer potential for automating this process, yet concerns about data privacy, intellectual property protection, and reproducibility limit the use of cloud-based solutions in research settings. ObjectiveTo evaluate the performance of a locally deployed 20-billion parameter LLM for automated abstract screening in systematic revi...

20
NIR autofluorescence allows for pituitary gland detection during surgery: the first evidence from microscopic studies and in vivo measurements
Top 1% (0.7%)
Show abstract

A critical challenge in endocrine neurosurgery is intraoperative discrimination between normal pituitary tissue and pituitary neuroendocrine tumors (PitNETs). Suggesting the universal persistence of near-infrared autofluorescence (NIRAF) in endocrine organs and inspired by routine clinical use of NIRAF for parathyroid gland identification, we discovered that pituitary NIRAF can be employed for label-free transsphenoidal surgery guidance. Ex vivo confocal spectral imaging of 33 specimens identifi...